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The requirenients for a theory of instruction can be 
described in the following list of criteria: 1) a model of the 
learning process; 2) specification of adrnlsslble instructional 
actions; 3) specification of instructional objectives; a) a 
measurement scale that permits costs to be assigned to each of the 
instructional actions and payoffs to the achievement of instructional 
objectivas, from these four elements it is possible to derive optimal 
instructional strategies. A theory of instruction is^ in fact, a 
special case of what has come to be known in mathematical and 
engineering literature as optimal control theory. Precisely the same 
problems are posed in the area of instruction except that the system 
to be controlled is the human learner. To the extent that the above 
four criteria can be formulated explicitly^ methods of the control 
theory can be used to derive optimal instructional strategies. Two 
examples involving the derivation of such strategies are considered 
in this paper; a computer-assisted instruction program for early 
reading and an individualized program for learning a foreign language 
vocabulary. Both are analyzed on control theoretic principles, 
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the following li^t uf criteria- 1) a model of the learning Drocesu; ^) Bpeclfica^ 
tion of ediniB^lble instructional actions; 3) Lipecif ication of Instructional objea- 
tiveG; k) a nieasu rement 3cale that permits co^^ts to he assigned to ^ach of the' 
instructional act ions and payoffs to tha achievement of instructional objectives. 
If these four elements can bo given a precise inteipztf xation^ then in general it 
Is possible \.G derive optljnal instnictignal nLrategieG, In terms of these criteria 
it is clear that a theory of instruction is^ in fact^ a special case of what has 
come to be known In the mathematical and engineering literature as optimal control 
tteoi^. Precisely tte sajna probleins are posed in the area of instnictfon^xcept 
that the system to be controlled is the hAiinan learner^ rather than a machine or a 
group of industries. To the extent that the above four criteria can be fomulated 
explicitly, methods of the control theory can be used to derive optimal instraC'^ 
tional strategies. Two examples invplving the derivation of optimal atrategie^c^ are 
considered In this paper. One deals with the development of a computer-assisted 
instruction prograi7i for teaching Initial reading in the early grades; the Recond 
example deals with learning a foreign-language vocabulaiy. In both ca^es, analyses 
based on cCiiLrol theoretic principles proved to be highly ndvantageous , 
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iNGREDIENTS FOR A THEORY UF INSTRUCTION^ 
Richard C. Atkinson'^ 
St:anf or d University 

The term ''theory of i f;ruct i on" has been in widGaprGad use for over a 
decade and during that time haa acquired a fairly specific meaning. By 
consensus it denotes a body of theory concerned with optimising the laarning 
process; stated otherwise^ the goal of a theory of Instruction is to pre- 
scribe the most effective methods for acquiring new information, whether in 
the form of higher-border concepts or rote facts. Although usage of the term 
is widaspre^d, there is no agr^^ment on the requirements for a theory of 
instruction. The literature provides an array of examples ranging from 
speculative accounts of how children shouLd be taught in the classroom tc 
formal mathematical mod^ 13 iipr-cifylng precis^ uranching prncfcidurc in 
computer-^conrrolled ins true ti on . ^ Such diversity is healthy; to locus on 
only one approach would not be productive in the long run, I prefei: to ur.e 
the term "theory of instruction" lo encompass both axperimental and 
theoretical research, with the theoretical work ranging from general 
spaculative accounts to specific quantitative models. 

The literature on instructional theory is growing at a rapid rate. So 
much so that, at this point, a significant contribution could be made by^ 
someone willing to write a book summarizing and evaluating work in the area. 
I am reminded here of Hilgard*s book. Theories of Learning first publisher: 
in 1948; it played an important role in the development of learning theory 
by effectively summarizing alternative approaches and placing them in 
perspective. A book of this type is needed now in the area of Instruction. 
My intention in this paper is no present an overview of one of the^ chapters 
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that I would like to hl^q includt^d .in nucl, a book; a niLiu' for c'm: wh^iDter 
might be "A decls ion-^theore t ic analvsis of i t r \ic 1 1 on Basicallv, f 
shall consider thp facL.^ry that i.eb-a to be oxamlriyd i.i Juiy.iug optirmj 
instruccional scrategieB and then usu this analysis to identify t:h^ kay 
elements of a theory or instruction. 

A decision-theorjlTic analysis of INBTRUHTIO^ 

The derivation of an optimal ytrattigy requircss thaL the i ns t r uc t iona I. 
problem ba stated in a form amenable to a decis ion- cheoret i c analyBly. 
Analyses based on decision tht^ury vary tiomtfwhat Ironi I'Leld to field, but 
the same formal elemeuth= can by found in most of them* As a sta^tlne point 
it will be useful to idcntiny Lh^rHe eiemgnts in a gcineral w^y , and rh^ni 
relate them to an ins r-uv Inn.- ; blLuation. They are a:^ frllavj\: 

1, Thy pOayxDla b.al^^i CL imLur^s 

2, The actions iliat: the ilyciaion-inaktir can take to transtorni zhe 
state of nature. 

3, The transformation of the state of nature that results from each 
action, 

4* The cost of each action* 

5, The return resulting from each state of aature. 
In the context of instfuction, these elements divide naturally into three 
groups* Elements 1 and 3 are concerned with a description of the learning 
process; elements 4 and 5 specify the cost-^benef it dimensions of r.he problem; 
and element 2 requires that the instructional actions from which the decision 
maker is free to chose be precisely specified. 
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require r:hat a model of the JBarning pvocBus exl^i. I- uauaiiy natural 
to IdenLify lh^^ staCes oi ^.hturo wicii the ..y^dr'uing ataLcri cl lu- sLuddnL. 
Specifying the Cran^iormaLion of the states of nacure caused by the actions 
of the decision-maker tantamount Co coni. c rue t Lng a m^u]^i or learning for 
the sicuation under con^ id^i^-at ion. The learning model ullj he prohnh i 1 1 s t i.a 
to rhe extent chat the Atat(^ of iMrning impertectlv nhHurvH'ile or the 
transformation of the state of learning that a given Lni^cruct: lonal action 
will cause is not coinpletely predictable. 

The specification of cofic^^ and returns in an ins true 1 1 o/al t^Ltuation 
(elernttnt^ 4 and 3) tendrf to ba iitraignr forward wh^^n exainined on a m^ort-^K^rm 
basis, but virtually inLracLable over the long-:-erm. .^or cii^ Hnn- t:er :n 
one cai* a.^.ri^n c.i.^7:.> ^n,! ^i-n,, y^,, !na-;t^ :v r.- . :i.y. , y^-.^- ^ I . ? ; 

reading skills, aut soph nicar^.d (U tarm Lnarion.^ ^ s.MV;-Nn-n => 

of th.ase aiciilB cu Unu imliviauai and aociery aro dlfilcuj: rn make, Vlrirf^ 
is an important role for derailed economic analy^eB of che lang-i:erm (npn ;:: 
of education, but such st:udies deal with issues at a more global level tnan 
we shall consider here. The present analysis will be limited to those 
costs and returns directly related to a specific instruatlonal task, 

Element 2 is criLinal in determining tae ef f ectivenass of a decib^on- 
theory analysis; the natire of this element can be indicated by an example. 
Suppose we want to design a supplementary set of exercises for an Initial 
reading program that involve both sight-word identification and phonics. 
Let us assume that two exercise formats have been developed, one for tr^^^^i.^q 
on sight words, the other for phonics. Given nh.-^f^ foi^aatis, i:hf?re are inan\ 
ways to design an overall progrMi. A variety of optimizarion problfc 
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can bd [?f£nar^ ::t2d rixliv? Ham^i .it-aLur^.i . : uhi5 rr Icii L\ii and haavinj ncM^r:- 
to be dfiC^rmLni^d in a uataor n r: Lcailv o-)^ trnat manvier . -''or ^;<ainDi.<3. - r ma^/ 
136 deB:orai)U to dfifarrnini^ nnv; uha Ciiia a\/aJ uaiUe inr i a-4i:r k^moi^ H'hiind b^^ 
divided between phdnicH and Ri^^hu word racognicion, wirh all atlia.; f^nturos 
of: the .:urrsculUTn fixed A more nompl nueMt lCh wntiin be in ri G::..?rn- . .^(^ 

riie ODnlma L order'nc ^.he ^-\;m ^vcrs nl exerciis*-^? in .^ddit:ff^n lm Hu. nr>t:lma] 
allonatJ<?u of timn. It ^^on Ui b - ean;^^ >'U .oriMnn^* ^r.^n.-^r - 1 1 nR HLff'^^r^'^^ 
optimisation probleins in this manner, The main poini: ii? chat varying 
set of actions from winch the deci -i f on-maktir is nr^o uc cho-Re rh-^n^^ n r^?^ 
decision problein, yven though the orhfir elerru^nrs remain tUo sam. . 

Once theBe fi'.e elemenr?; have hePii f^nr^ci f i ^?d , the jiext: va*^k 1?? 
deriv€: the optimal Btratogy rut the jearning marU?] Lha?; h,.:t^r desr^MI:-^i ih^^ 
^iCMat^nn, ;r Tncj:e h^n mh.; !;.?:mm>.? .ii^f^::.^ ^i^^'i^r ^^-'i^ a ■■■ ■ ^ - 

Liompetini^ candiilateri tar cnc.'i 4. sL:rateRV cau he i^irh*^.,^d. Wh m= ^Mnt-^- 

caakB havv^ been ac-.omp i i.ihed , an exot^ilmen'i CLn hi. das.Hne:! de'err" ne 
which strategy be^^t. There are F^^v^ral possli5le dlrenUonB wh\ m 
proceed after the inirial comparison of Htrategies, depending on the : u r^u. 
of Cha experiment. If n-ne of the Hupposadly oprimal ^rrhiui^^i-^^^ jrodur : - 
':?atisf actory results, then further exr^ar imen '.al avBlv^iy. the afi^^rrr^^*- - 
of the underlying learning moHei^ is indicated. New ^ssneB ma^ pri^^ -ver 
if one of the procedures is successful. tii the ^^econd exampie tiiar we whall 
discuss, the successful strategy produces an unusually high error ra*e dur^ ,; 
learnings which Is contrary to ^a widely accepted principle of pvowmmM 
Instruction (Skinner, 1968). When anomalias such as this occur, thev 
suggeBt new iine^y of exper iniental Inouiry . and o^'ter re..iu3rH a rtsf crmulac?e. 
of the learning raodRl-'^ 
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CRITERIA FOR A IliKORi OF LNSTRUCTION 
Our discussion to this point can be .summarized by listing four criteria 
that must be satisfied prior to the derivation of an optimal instructional 
strategy : 

1* A model of the learning process* 

2. Specification of admissible instructional actionK. 

3. Specification of instructional objectives, 

4. A measurement scale that permits costs to be assigned to each 
of the instructional actions and payoffs to the achievement at 
ins tr uc tionai obj !=^c t ives . 

If thi^st. L^ui: £-ae.uenL- ^.aii be given a urcwise Xnt ai pr^^La L th^in U 1^ 
generaily pussibie lu d..ti:l.c ar. opLiu.d.L ^nsLru.;:LiDnai nu.icy. 11, e r^oi.n • -1 
for ail wpriiu^j poli-.y i.^; ^ui r.,r,;..^.... ^ in...^r.: ^t.^iH ,.cw _ : 

toold hava been duvulQpiau icr ci.LSCQvct ing upiimaj. or near optima.. pruc..uu.i..- 
if they exists 

The four criteria listed above, taken in conjunction with methods for 
deriving optimal strategies, define either a model of instruction or a 
theory of instruction. Whether the term theory or model is used depends on 
the generality of the applications that can be made. Much of my own work 
has been concerned with the development of specific models for specific in-^ 
structional tasks | hopefully, the collection of such models will provide 
the groundwork for a general theory of instruction. 

In terms of the criteria listed above, it is clear t'hat a model or 
theory of instruction is in fact a special case bf whac has come to be 
known In the mathematical and engine^tring literature jptimal control 
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cheMj/ or, more Eimpiy, cmtj oi:. J^1BP:^1 (^laUianj Faib, 6 Atl ibj og^.* tha 
development of control theory has progressed at a rapid rate Doth in the 
United States and abroad , but most oi the applications involvu engineering 
or eGonomlc systems of one type or anoLher, Precisely the same problems 
are posed in the area of instruction except that the system to be cyiiirollad 
is the human learner, rather than a machine or group of industries. To the 
axtent that the above four slements can ba formulated explicitly, mettiods 
of control theory can be used in deriving optimal instructional strategies. 

To make some of these ideas more precise, we shall consider two examples. 
One Involves a responae-insensit Ive sLrategy and the other a response-sentu ti- 
sKatejiY* A response-insensitive strategy orders the instructional mateiJ^dU 
without taking into account the student "S responses (except passibly to prDuln 
viorrectj.ve feedback) as b,\ progrepBes thruu^h tha curricalgm. In ^..nn--H.; . 
a response-sensitive strategy maRe^ use uf the stuaen.^^s cespoase mstory 
in ita stage-by-stage decisigns regarding which currlcuium i.aturials Hr».B.. 
next. Response-insensitive strategies are complete.ly specified in advance 
and consequently do not require a system capable of branching during an 
Instructional session, liesponse^sensitive strategies are moLe compleK, but 
have the greatest promise for producing significant gams for they mucc be 
at least as good, if not better , than the comparable respon^se-insensitive 
strategy. 

OPTIMIZING INSTRUCTION IN INITIAL READING 
The first example is based on work concerned with the development of h 
computer-assisted instruction (CAI) program for teaching reading in tha 
primary grades (Atkinson & Fletcher, 1972), The prcgram prgyidea indlvid^ 
ualized instruction In reading and is uiaed as a suppiemeAt to normal 
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classroom teaching | a given student siay spend anywhere from zero to 30 
minutes per day at a CAI terminal. For present purposes only one set of 
resulta will be considered, where the dependent measure is performance on 
a itandardized reading achievement test- administered at the end of the 
first grade. Using our data a statistical model can be iormulated that 
predicts test performance as a function of the amount of time the student 
spends on the CAI system. Specifically, let P^(t) be student i^s performanc 
on a reading test administered at the and of first grades given that he 
spends time t on the CAI system during the school year. Then within 
certain limits the following equation holds i 

P^(t) - a. - ejexp("Yit) 
Depending on a student's particular parameter values, the more time spent 
on the CAI program the higher the levf:! of achievement at the and of thf^ 
year* The parameters a, and characterize a given student and vary 
from one student to the nent; a ana C^-S) are measures of the .student 's 
maximal and minimal levels of achievement respectively s and y is a rate 
of progress measure. These parameters can be estimated from a student "S 
reaponsa record obtained during his first hour of CAI, Stated otherwise^ 
data from the first hour of CAI can be used to estimate the parameters 
a J 8 1 and y for a given student, and then the above equation enables us to 
predict end-of-year performance as a function of the CAI time allocated to 
that student, 

The optimization preblem that arises in this situation Is as follows^ 
Let ui suppose that a school has budgeted a flKed . inount of time T on thtf 
CAI system for the school year and must decide how to allocate the time 
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ajnong a class of n firat-grade students. Assume, further, that ail student.^ 
have had a preliminary run on the CAl system so that estlmaces of the 
parameters a, g, and y have been obtained for each student. 

Let t£ be the time allocaced to student i* Then the goal is to select 
a vector (t^, t2,..»jtj^) that optimizes learning, To do this let us check 
our four criteria for deriving an optimal strategy. 

The first criterion Is that vre have a model of the learning process. 
The prediction equation for Pj^(t) does not offer a very complete account 
of learning; however, for purposes of this problem the equation suffices as 
a model of the learning process * giving all of the information chat is 
required. This is an important point to keep in mind: the nature of the 
specific optimisation problem determines the level of complexity that must 
be represented in the learning model* For some problems the mcdei must 
provide a relatively completa account of learning in order to derive an 
optimal strategy, but for other problems a simple descriptive equation ol 
the sort presented above will suffice, 

The second criterion requires that the set of .admissible instructional 

actions be specified. For the present case the potential actions are simply 

all posslbia vectors (t^i t25*i.,t^) such that the t^*s are non-negative 

and sum to T, The only freedom we have as decision makers in this situation 

is in the allocation of CAI time to individual students, 

■ 

The third criterion requires that the instructional objectiva be 
Specified, There are several objectives that we could choose in this 
situation. Let us consider four possibilities i 

(a) Maximize the mean value of P over the class of students. 
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. (b) MinimlEe the variance of P over the claas of students. 

(c) Maximize the number of students who score at grade level at the 
end of the first year, 

(d) Maximize the mean value of P satisfying the constraint that the 
resulting variance of P is less than or equal to the variance 
that would have been obtained if no CAI was administered. 

Objective (a) maximizes the gain for the class as a whole; (b) aims to 
reduce differences among students by making the class as homogeneous as 
posiiblei (c) is concerned specifically with chose students that fall 
behind grade level; (d) attempts to maximize performance of the whole 
class but insures that differences among students are not amplified by 
CAI* Other instructional objectives can be listed, but these are= the oneg 
that seemed most relevent* For expository purposes, let us select (a) as 
the Instructional objective. 

The fourth criterion requires that costs be assigned to each of the 
instructional actions and that payoffs be specified for the instructional 
objectives. In the present case we assume that the cost of CAI does not 
depend on how time is allocated among students and that the meaauremant 
of payoff is directly propdrtional to the students' achieved value of P. 

In terms of our four criteria^ the problem of deriving an optimal 
instructional strategy reduces to maximizing the function 

n_ 
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subject to the constraint chat 
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This maKimization can be dune by using the method of dynamic programniing 
(Bellman^ 1961)* In order co illustrate the approach j computations were 
made for a first-grade class where the parameters a, and y had been 
estimated for each student* Employing these estimates , computationB were 
carried out to determine the time allQcations that maximiEed the above equa= 
tion. For the optimal policy the predicted mean performance level of the 
class, was 15% higher than a policy thac allocated time equally lo ntxidanLt: 
(i,e*, a policy where ^ tj for all x and j). This gain repriisciu ^ 
sCantial improvement; the drawback is uhat che variance ut Uia F ;?curtjs it. 
roughly 15% greater than for ih& equal- time policy* Tnis msanii tiidu if v/;:^ 
are interested primarily in raising the class average , we must lat the rapid 
learners move ahead and progress far beyond the slow learners* 

Although a time allocation that complies with objective (a) did increaA^i^ 
overall class performance, the correlated increase in variance leads us 
to believe that other objectives might be more beneficial. For comparison^ 
time allocations also were computed for objectives (b)j (c) , and (d) * Figure 
praseata the predicted gain in P as a percentage of P for the equai-timG 
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Insert Figure 1 about here 



policy I Objectives (b) and (c) yield negative galas and so thty whouJd bince 
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Figure li Percent gains in the mean value of P whan compared wich an 

equal«time policy foi* fouir policies each based on a dlffftrent 
instructional dbjectlve. 



Atkinson 11 

their goal is to reduce variability, which is accomplished by holding 
back on the rapid learners and giving a lot of attention to the slower 
ones* The reduction in variability for these two object ives, when compared 
with the equal-time policy ^ is 12% and 10%, respectively. Objective (d) , 
which attempts to strike a balance between objective (a) on the one hand 
and objectives (b) and (c) on the other, yields an 8% increase in P and 
yet reduces variability by 6%* 

In view of these computations | objective (d) seems to be preferred; it 
offers a substantial increase in mean performance while maintaining a low 
level of variability i As yet, we have not implraented this policy , so 
only theoretical results can be reported. Nevertheless, these examples 
yield differences that illustrate the usefulness of this type of analysis* 
They make it clear that the selection of an instructional objective should 
not be done in isolations but should involve a comparative analysis of 
several alternatives taking into account more than one dimension of per^ 
fomancei For example ^ even if the principal goal is to maximize P, it 
would be Inappropriate In most educational situations to select a given 
objective over iome other if it yielded only a amall average gain while 
variability mushroomed. 

Techniques of the sort presented above have been developed for other 
aspects of the CAI reading program* One of particular interest involves 
deciding for each itudent, on a week*by-week basis, how time ihould be 
divided between training in phonics and in sight-word identification 
(Chant & Atkinson, 1972), However ^ these developments will not be con-^ 
sidered here; it will be more useful to turn to another example of a quite 
dlfferint type. 
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OPTIMIZING THE LEARNING OF A SECOND^LANGUAGE VOCABULARY 
The second example deals with learning a f oreign'-language vocabulary, 
A similar example could be given from our work in initial readingj but 
this particular eKample has the advantage of permitting us to introduce 
the concept of learner=contf oiled instruction. In developing the example 
we will consider first some experimental work comparing three instructional 
strategies and only later explain the derivation of the optimal strategy*^ 

The goal is to individualize instruction so that the^ learning of a 
second'-language vocabulary occurs at a maximum rate. The constraints imposed 
on the task are typical of a school situation, A large set of German-English 
items are to be learned during an Instructional session that involves a 
series of trials* On each trial CPne of the German words is presented and 
the student attempts to give the English translation; the correct trans^ 
latlon is then presented for a brief study period* A predetermined number 
of trials Is allocated for the instructional session, and after an interve.nin 
period of one week a test is administered over the entire vocabulary. The 
optimiEation problem is to formulate a strategy for presenting items during 
the instructional session so that performance on the delayed test will be 
maximized. 

Three strategies for sequencing the instructional material will be 
considered. One strategy (designated the random-order strategy) is simply tc 
cycle through the set of items in a random , order * this strategy is not 
expected to be particularly effective but it provides a benchmark against 
which to evaluate others, A second strategy (designated the learner- 
controlled strategy) is to let the student determine for hijnself how best 
to sequence the material* T=n this mode the student decides on each trial 
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which item is to be tested and studied; the learner rather than an external 
controller determinee the sequence of instruction. The third scheme 
(designated the response-»sensitive strategy) is based on a decisions-theoretic 
analysis of the ins tructional task* A mathematical model of learning that 
has provided an accurate account of vocabulary acquisition in other exparl^ 
meats is assumed to hold in the present situation. This model is used to 
compute^ on a trial^by^ trial basis, an individual student -s current state of 
learning. Based on these computations j items are selected from trial to 
trial so as to optimize the level of learning achieved at the termination 
of the instructional session. The details of this strategy are complicated 
and can be more meaningfully discussed after the experimental procedure 
and -results have been presented. 

Instruction in this experinient Is carried out under computer contrcJ . 
The students are required to participate in two sessions: an instruct ion a I 
session of approximately two hours and a briefer delayed- tes t session 
administered one week later. The delayed test is the same for all students 
and involves a tost over the entire vocabulary. The Instructional session 
Is more complicated. The voeabulary Items are divided into seven lists 
each containing twelve German words; the lists are arranged in a round-robin 
order (see Figure 2), On each trial of the Instructional session a list is 

Insert Figure 2 about here 



displayed and the student inspects it for a brief period of time. Then one 
of the items on the displayed list is selected for test and study. In the 
random-order and response«sensltive conditions the item is selected by the- 
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computer. In the learner-controlled condition the item is chosen by the 
student. After an itan has bean selected for tests the student attempts 
to provide a translation; then feedback regarding the correct cranslation 
is givens The next trial begins with the computer displaying the next list 
in the round-robin and the same procedure is repeated. The instructional 
session continues in this fashion for 336 trials (see Figure 3). 

Insert Figure 3 about here 

The results of the experiment are summarized in Figure 4, Data are 

Insert Figure 4 about here 



presented on the left side of the figure for performance on successive 
blocks of trials during the instructional session; on the right side are 
results from the test session administered one week after the instructional 
seision* Note that during the instructional session the probability of a 
correct responie is highest for the random=order conditionj next highest for 
the learner-controlled condition, and lowest for the response^sensitlve 
condltiont The results ^ however, are reversed on the delayed test. The 
reeponse-sensitive condition is best by far with 79% correct; the learner- 
controlled condition is next with 58%; and ^the random-order condition is 
poorest at 38Xi The obierved pattern of results is expected- In the 
learner-controlled condition the students are trying, during the instructioaa 
session, to test and study those items they do not know i should have a 
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figure 3 1 Flow chart describing the trial sequence during the Instructional 
session* The ielectlon of a word for teit on a given trial 
(box with heavy border) varied over experimental conditions. 
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Figure 4i Propdrtion of correct responses in succesiive trial blocks 
during the instructional session^ and on the delayed test 



admlniitered one week later. 
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lower scpre than students in the random-order condition where testing is 
random and includes many items already tnas tared* The response^senait Ive 
procedure also attempts to Identify for test and study those items that 
have not yet^ean mastered and thus also produces a high error rate during 
the Instructional session. The ordering of groups on the delayed test is 
reversed since now the entire set of words is tested; when all items are 
tested the probability of a correct response tells us how much of the 
list actually has been mastered. The magnitude of the effects observed on 
the delayed test are large and of practical significance. 

Now that the effectiveness of the response^sensitive strategy has been 
eatablishedj let us turn to a discussion of how it was derived. The strategy 
is based on a model of vocabulary learning that has been investigated 
in the laboratory and shown to be quite accurate (Atkinson & Crothers, 1964; 
Atkinson I 1972), The model assumes that a given, item is in one of three 
states (Pj Tj and U) at any moment in time. If the item is in state P then 
its translation is known and this knowledge is "relatively'^ permanent in 
the sense that the learning of other vocabulary icema will not interfere 
with itp If the item is in state T then it Is also known ^ but on a 
"temporary'* basis; in state T other items can give rise to interference 
effects that cause the itan to be forgotten. In state U the item is not 
known and the student is unable to provide a translation. Thus in states P 
and T a correct translation is given with probability onm^ whereas in 
state U the probability is zero* 

Wien a test and study occurs on a given item the following transition 
matrix describes the possible change In state from the onset of the trial 
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to Its termlnationj 
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Rows of the matrlK represent the state of the Iceni at the start of the trial 
and columns its state at the end of the trial. On a trial when some other 
item is presented for test and study, a transition in the learning state of 
our original item also may take place; namely, forgetting is possible in 
the sense that if the item is in state T It may transit into state U* 
This forgetting can occur only if the student makes an error on the other 
item; in that case the transTtton matriK applied to th^t original itein Is 
as follows: 

P T U 

P 1 0 0 
F ^ T 0 1-f f 
U 0 0 1^ 

To summarigej consider the application of matrj.ces A and F to some specific 
item on the list; when the iten itself is presented for test and study 
transition matriK A Is applied; when some other item is presented that 
elicits an error then matrix F is applied. The above assumptions provlrje 
a complete account of the learning process. The parameters in matrices 
A and F measure the difficulty level of a German-English pair and vary 
across Items, On the basis of prior experiments, numerical estimates of 
these parameters exist for each of the items used in the experiment. 
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As ndttd eaiMit^r, iri- lormulaLion oi n )^f:rai:ta'>^ r^iMtui rt^c; t:hcit w ho. 
precise about the quanclty Co be maKimUed. For fhe present Cask, t\w. goal 
is tp nuixiiniF:g the nunihi-r vt iven'f, rntr^fcv]v traurlatcv on the dej..iyGd uent . 
To do this, a rhauretlcal relationship must bt^ spt^cifit^d between t:he stiate 
□f learning at, th^ end of *:h& in^ trucLlona : scissior and perf ormancc^ cm 
the delayod test. Tht J^sumptiDn mrirjp: !u*i r in ^hat c:ilv thosn 1 rnms in 
state P at the and of the InfitructJonal AaBslun will be translatBd correctly 
on the delayad test; an item In state T is presumttd to be fprgotten during 
the intervening week. Thus, the problem of maKimi^nng delayed--t€^st perfor- 
mance InvQjveB, at least in thHory, maKlmiKing the nntnhfT of itcjms In state 
P at the termlnatiDn of the Inst ructf dhaI ?^^sfiinn. 

Having numerical values for parameterR nnd knowing the student'^ resDons 
historvi it is pns^iilbl^ t '^it-n^AuJ' his rurr^M^^ i^n, -tT i-Hr-;:-- ^ ' -i 

more prec Uely , the learnih^ rnoclnl *^an h^, ust^d to dnrivft ^ .^M^jtlnn?^ r*nd , I - 
turn^ comp»*tc th^ prob^b i ) j * i c^s^ ct being in >st^»;r^s , 'V :r^d ij K^pri\ irrm 

at the start of trial n, condi f icna 1 3 .^^d on thri Ptudent'L^ zmpc^n-i- hisrcrv 
up to and including trial n-^l. Given nuinerlcal estimates of theae probabilM. 
a strategy for optimising per fonnaniic Is to seiect that item for pres^f.ca t r-^n 
(from the current display lisL) that has ihe greatest puohnbllUy of m:^.r ink 
Into state P if it is tested and studied on the ^:rTal. This strataf^v hag 
been termed the one-'Stage optimization prDcadure becauae it lookrs ahead one 
trial In making decisions. The true optitral policy (i*a,, an N--?^t>u.^ --^ 
cedure) would consider all possible Item-response ^^quencM rnr the rm^^rii- 
trials and select the next it™ so as to maximize the number of items in 
State P at the tanuJnanion of the InHtructlDns J le^^lor*. For tin p-^K^rii 
cas© the N-stage policy cannot be applied bacausB tha n^cms^/^^' cDmpuLat iani^ 



are too time consuining evvin ior ji in^ft cr^h'vntisV . Vuv inui^Lidlv , MoiUu Cairlc 
studies Indicate chat tli^ cm-stBg& puilcy a good d pprox Imau to uui 
optimal strategy for a va* iet:y jj. i^jikcv/ li^arruiu nud%M^ ; it wns ; .»i thiH 
reason, as well as the relativi^ ea^^^ 'M' cuinpuiiiig, linai Khe oue-Bt^/^iS pro- 
cedure was empioyed.'' Th^ compuLatic oaj pi'^a-edure a^ricrlbed abuv-^ wa^ 
implemented on th^f compuce^r and do-liui Ltfcd atici-lMnH ' he madi jn-linf^ 
for each student on a triril-by-tr lal basis. 

The rfcsponse-stsnsicivt^ strategy undoubtedly can bB improved upon by 
elaborating the learning model. Those familiar with developm*3nts .in learnl' 
theory will see a nuuiber ot way^ or intrQduciiu> more compiexiny into the 
model and thereby increa^inu ilb precision. Wt^ wiLI nor pursue ^uich con-- 
slderations herej howevdr, since our reason tor preBencin^ the gK^ainple was 
not to theorists aiijout J ^ . " uu a-: bur vis^^K-^ ; - ,rt,j ; ^ r ^, 

simple learning model can be used f nefin^ innivucii ma i or ^^-ra.rt^. 

CONCJ.UDING REMARKS - 
HopiSfully, theac^ cvo ^rXdmT)}.-,\ i'h[^tv. i:e ih^ H-^=r-i invOlv^ui in av^'^^i. , 
ling an opclnMl sLrar^jgy r^a^ ins true i: hoib nx^jmplii-; 0^::/:L wir}, r l=i1/h : j.v^- ■ 

simple prob.-.-m/4 dnd ihuj^ noL i,;J.;cj= : i?e v - 1 c::'i -^rM , -Ir^'. 

have been made or that: ars clearly uOBsiMe. In ^■'ouid he a mi taK^? , bo;./ci*v;^i. 
to conclude that this approach offers a Hnlutioii to tni problems racing 
education. There are sonie f undainiintal cbi.U^c.i.es Lhac Uml!: the cruniivra j nv 
of the work* 

The major obstaclti^^ iimy bi^; td(?:'U i f 1-id In L^^ruuj ^.f thi^ ica.-: ■ ' t :.x ' 'X 
we specified as prferequislr^s lor dn opi/im^il .;l;;atfjgy= Th^ tlrBl c;/.U.^=/ i 
concerns the formulatioa of leriiLiiu^; hioc^^lu. Thcr* mgdols that, now cmu..,. 
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totaiiy inadequate to txplain Lhu HuDcle Weiys by whicli Lh^ human organism 



stores a processes s and retrieves intormat ion » Until havy a much deeper 



understanding of learning ^ the i.d en L i f; ica tion at cruly e 



f t; e c t i V e y t r a t eg i es 



will not be possibie. However, 



an all=-inclusive theory of learning is not a 



prerequisite for the develnpmenC of optimal procedures. Vvhat is nt?eded 
instead is a model that captures the essentLal featureB of that: part of the 
learning process being tapped by a given inscruc tional task. Even models 
that may be rejected on the basis of laboracory investigation can be usefui 
in deriving instructional strategies. The two learning iimdels considered In 
this paper are extremely simple^ and yet the optimal strategies they generato 
are quite effective. My own preference is to formulate as complete a 
learning modei as intuition and data will permit and then use that model to 
InvestlgHte optiriiai pcoued^J i en : when prt-ii^ifh^e thu learning rnodt^l will h-: 
represantad in the forni of maLhematlcal equations but: utiharwlH'i; as a sc^K 
of statements in a computer^-aimulation prgKrain^ Ihe main point is that the 
development of a theory of instruction cannot progress it one holds the vjev 
that a complete theory of learning im a prerequiaitei Ratherj advances in 
learning theory will affect the developiaunt of a theory of instruction, and 
conversely the development of a theory of Instruction will influence r as ear rh 
on learning* 

The second criterion for deriving an opcimal strategy requires that 
adnilssible instructional actions be clearly specified* The set ot pntsitf 
instructional inputs places a definite limit on the effectiveness of cha 
optima-l strategy. In my opinion powerful instructional strategies must 
necessarily be adaptive; that is, they must be sensitive on a momenc--to-' 
moment basis to a learner's unique response history. My judgment on this 
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matter is based nu limitf^d expaTievce. r&i?f:r i r n^'d prlm^irflv tn r^^.^Qarnh on 
teaching initial reading. In this area, how^ever , th& evLdence ^^eems to be 
absolutely clear: the rnani pulati on of method vsriablo^i afcnnntn for onlv 
a small percentage of the variance when not accorip;^ni Hv InBtnictional 
strategies that permU ind i.vldMalizatlon . Method varjables like rhe modifif?d 
teaching alphabats oral i^eading, ^.ho IJngulntic appro^ich, and ochers 
undoubtedly have beneficial effecrs* However, theaa effects are minimal 
in comparison to the impact that is possible when instruction ie adaptive 
to the individual learner. Significant progress in deaJlng with the nation 
problem of teaching reading will require Ind LvidMal ly prmMcrihad programs, 
and sophisticated programs ^ill necfS^Bitate s?am^ d^gr^?^^ of computer inter- 
vention either in the form of CAI or computer-managed instruction. As a 
corollary to zhtn poinii , .1^ li^ t^viden?: f^om oh 

CAI Reading ProgrAm that rhe mor'^ ^:if f enf iv^ nhj^ ad?^ptlve r'trafi;^!^v uhi? 1^;- ^ 
important are eKtrinslc mucivators* Mortvation Ir a variabJ.^ In any for^ or 
learning, hut when the instructional proce^ss is t-ruly adaptive thp. firndenv'^^ 
progress is sufficent reward in its own right. 

The third criterion for an optimal strategy deal« with Instruc ti onai 
objectives^ and the fourth with cost-benefit mt^ayurcs* In i:he ana3vsen 
presented here^ it was tacitly assumed that the curriculum niaterial beung 
taught is sufficiently beneficial to justify allocating time to it. Further, 
in both exampleR the costs of instruction ver^ assumed to ha thti fim^e t or 
all strmtegies* If the costs of inBtruction ar^ equal for all ^tratngi^'S , 
they may be ignored and attention focused on the comparative benmf^w of tn 
strategies. This is an Important point because It greatly F;implif i^:S thp 
analysis. If both costs and benefits are significant variables, thi?n it 
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Is essential that both be accuratfeiy esclmatad. ThiH \^ nfton JilsMcult to 
do* When one of these quantities can be ignored , It suffices if the other 
can be agsessed accuraCaly enough to order the possibiiM outcome.^, As a 
rulej both costs and benefits must he weighed In the analysis, and fre- 
quently subtopics within a curriculum vary si gn i f lean t Ty in their importance. 
In some cases * whether or not a certain topic should be taught at all is the 
critical question, Smallwood (1971). has treated probleTns similar to the 
ones considered In this article in a way that includes some of these factors 
in the scructure of costs and benefits. 

My last reiiiarks deal with the issue of le^irner-control led instructton. 
One way to avoid the challenge and responsibilj ty of developing o theory of 
instruction is to adopt the vlev; that the learner is thc^ best judge of what 
to study^ whf:n to study, and hiiw to sttidv . I atii Hl.f\rnv^d hv t:i;p iM^mber of 
Individuals who advocate this position d^^^finlte a Ereat (ieal of nBgritive 
evidence. Don't misinterpret this remark. There ohviouaiy Is ^ placj^ for 
the learner's Judgments in making inBtruc tional doclslons. In j^wc^fiI CA] 
programs that I have helped develops the Itimrner plays an iiDportsnt role in 
deteriiilnlng the path to be followed through the curr: -am* However, using 
the learner-s judgment as one of several items of inlurinatlon in making .jn 
instructional decision is quite different from proposing that tha Ic^arner 
should have complete control. Our data, and the data of others ^ indicate 
that the learner Is not a particularly effecciue decision maker. Ar KVin-^^n, 
against learner-controlled programs are unpopular In the nrijsent cllmar^^ 
opinion s but they need to be made so that we will not be seduced b" the 
easy answer that a theory of Instruction Ln not -reonirc^d bec^usr^, ^S-^hn en = 
ERJjC a better judge of wiiat is best for th^ student than the studc^nt hiTfiPslf," 
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The aim of tbi£3 paper wjs to ilj^in; tr^itu the ^Ue^pf^ lnvo]yad in /riving 
an optimal sLrategy and their implicaticns xox a ih^or\ of in.st rucLian, t 
want to mphaslze a point: mad a CIim oue^^ t^t;----nam^.lv tA^ni t he H!:--rn^iC!^ is 
only one of many that needB to by puiirjued. Obvicuslv the main obstacle 



that we most want to optimizfi. However^ an the exfimplos indicate, analyses 
based on highly sinipiified models can bs useful in idenuifying problyins and 
focusing research efforts. It seeins clear that this typG of retitiarch is a. 
necessary component in a program designed to develop a general thi^^^orv of 
Instruction^ 



is that adequate theories as yet do not 



t i ur thp. .1 oarnir 



g proes^s ^es 
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Gagna (1970), Saidel and Hunter (1970), Pask and Scott (1971), and 
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For a morte extensive disctu^sian of f^omB oS theaa poinna so^ A^kxa.^:jfi st^d 
Paulson (1972), Calfee (1970), Dear, «t ai. (1967), Laubsch (i970> , and 
Smallwood (1971). 

^A detailed account of this research can be found In Ackinson (1972). 

The student's res pons e history is a record (for aach trial) of the item 
presented and the response that occurrad, It can be shown that a sufficient' 
hiatory exists which concains only the information necessary to estimate 
the student's current state of learning; the sufficlsnt history isi .u functtDn 
of the co^leta history and the assumed learning aaodal. For th^ mcriel 
eonsidered here the sufficient history is fairly simple, but cannon be 
readily described without extensive notation. 
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For a dlscufsion of one-stage and iN-'Stage policies and Honte CarJ.o studies 
comparing themp see Groen and Atkinson (1966), Calfee (1970), and Laubsch 
(1970) . 
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